TNL: NUMERICAL LIBRARY FOR MODERN PARALLEL ARCHITECTURES
نویسندگان
چکیده
We present Template Numerical Library (TNL, www.tnl-project.org) with native support of modern parallel architectures like multi–core CPUs and GPUs. The library offers an abstract layer for accessing these via unified interface tailored easy fast development high-performance algorithms numerical solvers. is written in C++ it benefits from template meta–programming techniques. In this paper, we the most important data structures TNL together scalability on speed–up GPUs supporting CUDA.
منابع مشابه
A numerical differentiation library exploiting parallel architectures
Article history: Received 7 July 2008 Received in revised form 15 January 2009 Accepted 10 February 2009 Available online xxxx PACS: 02.60.Jh 02.60.Pn 02.70.Bf
متن کاملA Parallel Numerical Library for UPC
Unified Parallel C (UPC) is a Partitioned Global Address Space (PGAS) language that exhibits high performance and portability on a broad class of shared and distributed memory parallel architectures. This paper describes the design and implementation of a parallel numerical library for UPC built on top of the sequential BLAS routines. The developed library exploits the particularities of the PG...
متن کاملNDL-v2.0: A new version of the numerical differentiation library for parallel architectures
Wepresent a new version of the numerical differentiation library (NDL) used for the numerical estimation of first and second order partial derivatives of a function by finite differencing. In this version we have restructured the serial implementation of the code so as to achieve optimal task-based parallelization. The pure shared-memory parallelization of the library has been based on the ligh...
متن کاملParleda: a Library for Parallel Processing in Computational Geometry Applications
ParLeda is a software library that provides the basic primitives needed for parallel implementation of computational geometry applications. It can also be used in implementing a parallel application that uses geometric data structures. The parallel model that we use is based on a new heterogeneous parallel model named HBSP, which is based on BSP and is introduced here. ParLeda uses two main lib...
متن کاملEfficiency improvements of iterative numerical algorithms on modern architectures
For many numerical codes the transport of data from main memory to the registers is commonly considered to be the main limiting factor to achieve high performance on present micro architectures. This fact is referred to as the memory wall. A lot of research is targeting this point on different levels. This covers for example code transformations and architecture aware data structures to achieve...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Acta Polytechnica
سال: 2021
ISSN: ['1210-2709', '1805-2363']
DOI: https://doi.org/10.14311/ap.2021.61.0122